Discriminant wavelet basis construction for speech recognition
نویسندگان
چکیده
In this paper, a new feature extraction methodology based on Wavelet Transforms is examined, which unlike some conventional parameterisation techniques, is flexible enough to cope with the broadly differing characteristics of typical speech signals. A training phase is involved during which the final classifier is invoked to associate a cost function (a proxy for misclassification) with a given resolution. The sub spaces are then searched and pruned to provide a Wavelet Basis best suited to the classification problem. Comparative results are given illustrating some improvement over the Short-Time Fourier Transform using two differing subclasses of speech.
منابع مشابه
A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملWavelet Based Recognition Using Model Theory for Feature Selection
An increase in accuracy and reduction in computational complexity of the common wavelet-based target recognition techniques can be achieved by using interpretable features for recognition. In this work, the Best Discrimination Basis Algorithm (BDBA) is applied to select the most discriminant complete orthonormal wavelet basis for recognition purposes. The BDBA uses a relative entropy criterion ...
متن کاملApplication of Wavelet Packet Transform in Pattern Recognition of Near-IR Data
The wavelet packet transform is studied as a tool for improving pattern recognition based on near-infrared spectra. Application to the preprocessing of the spectra improves the classification when compared to using either the standard normal variate method or no pretreatment at all. Selecting features from a local discriminant basis instead of from a full decomposition does not improve the resu...
متن کاملAn analog VLSI architecture for auditory based feature extraction
We have developed a low power analog VLSI chip for real time signal processing motivated by the principles of human auditory system. A analog cochlear lter-bank (which is implemented on the chip) decomposes the input audio signal into several frequency bands that have almost equal bandwidth on a log scale. This step is thus similar to computing the wavelet transform. The chip then computes sign...
متن کاملSpeech Emotion Recognition Based on Deep Belief Networks and Wavelet Packet Cepstral Coefficients
A wavelet packet based adaptive filter-bank construction combined with Deep Belief Network(DBN) feature learning method is proposed for speech signal processing in this paper. On this basis, a set of acoustic features are extracted for speech emotion recognition, namely Coiflet Wavelet Packet Cepstral Coefficients (CWPCC). CWPCC extends the conventional MelFrequency Cepstral Coefficients (MFCC)...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998